Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

نویسندگان

چکیده

Social background profiling of speakers refers to estimating the geographical origin by their speech features. Methods for accent that use linguistic features, require phoneme alignment and transcription samples. This paper proposes a purely acoustic model, composed multiple convolutional networks with global average-pooling layers, classify temporal sequence The bottleneck representations networks, trained original signals low-pass filtered copies, are fed Support Vector Machine classifier final prediction. model has been analysed dataset Indian from social backgrounds spread across India. It shown up 85% accuracy is achievable classifying geographic corresponding regional languages; 17% higher than benchmark deep learning using same Results have also indicated classification accents easier second language speakers, as compared native language.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating number of speakers by the modulation characteristics of speech

A method for estimating number of speakers of mixed speech signals was proposed. The algorithm was based on the modulation characteristics of speech, specifically that a single speech utterance typically has a distinct modulation pattern with a peak around 4-5 Hz. Having observed that the modulation peak decreases as number of speakers increases, our estimation algorithm used the region of the ...

متن کامل

the effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension

کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...

15 صفحه اول

Analysis of Acoustic Features in Speakers with Cognitive Disorders and Speech Impairments

This work presents the results in the analysis of the acoustic features (formants and the three suprasegmental features: tone, intensity and duration) of the vowel production in a group of 14 young speakers suffering different kinds of speech impairments due to physical and cognitive disorders. A corpus with unimpaired children’s speech is used to determine the reference values for these featur...

متن کامل

Acoustic features of vowel production in Mandarin speakers of English

English vowel productions were acoustically examined in a group of native Mandarin speakers. The first and second formant frequencies (F1 & F2) of 11 English vowels were examined in the syllable-level productions of 40 Mandarin speakers compared to 40 American English speakers. Results of the comparative acoustic analysis indicated that the Mandarin speakers differed significantly from the Amer...

متن کامل

An Introduction to Speech Sciences (Acoustic Analysis of Speech)

Speech sciences deal with the acoustical characteristics of speech by means of sophisticated soft wares as well as hard wares. Although, a speech science is a well known science in the developed countries, especially the western societies, however, it has been remained almost unknown in Iran, though, in recent years a group of scholars have been involved in this branch of science. The applicati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Scientific & Industrial Research

سال: 2023

ISSN: ['0022-4456']

DOI: https://doi.org/10.56042/jsir.v82i08.3122